Corpus: lit-lt_web_2020_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 96 98 99 99 99
1000 929 995 999 999 999
10000 7357 9734 9964 9978 9982
100000 7358 9735 9965 9979 9983
1000000 7358 9735 9965 9979 9983


Zipf's diagram for sentence endings


Gnuplot diagram

1414 msec needed at 2021-09-16 16:01